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Abstract: Using computer methods for database search and 
multiple alignment, statistically significant sequence similarities 
were identified between several nitrilases with distinct substrate 
specificity, cyanide hydratases, aliphatic amidases, jS-alanine 
synthase, and a few other proteins with unknown molecular 
function. All these proteins appear to be involved in the reduc- 
tion of organic nitrogen compounds and ammonia production. 
Sequence conservation over the entire length, as well as the sim- 
ilarity in the reactions catalyzed by the known enzymes in this 
family, points to a common catalytic mechanism. The new fam- 
ily of enzymes is characterized by several conserved motifs, one 
of which contains an invariant cysteine that is part of the cata- 
lytic site in nitrilases. Another highly conserved motif includes an 
invariant glutamic acid that might also be involved in catalysis. 

Keywords: enzyme evolution; homology search; nitrogen 
metabolism 



Utilization of nitrogen compounds usually involves several 
reduction steps. The final pathway frequently is the NH 4 + as- 
similation or transfer of NH 4 + to various intermediates such as 
keto acids (for review, see Willison, 1993, and references 
therein). We report here significant sequence conservation 
among numerous enzymes that are involved in nitrogen metab- 
olism and that cleave nitriles as well as amides (Fig. 1; Table 1). 
Based on their sequence similarity and on the reactions they cat- 
alyze, these enzymes can be classified into 6 functionally distinct 
groups. 

1 . Nitrilases cleave various nitriles into the corresponding 
acids and ammonia. The family described here includes en- 
zymes with specific substrates such as bromoxynitrile, 
arylacetone, and indole-3-acetonitrile, but also wide- 
spectrum nitrilases (e.g., acting on aromatic or aliphatic 
nitriles). Prokaryotic nitrilases are mainly involved in uti- 
lization of various nitrogen sources (Nagasawa et al., 1990; 
Kobayashi et al., 1992); plant nitrilases participate in the 



Reprint requests to: Peer Bork, European Molecular Biology Labo- 
ratory, Meyerhofstrasse 1, 691 17 Heidelberg, Germany; e-mail: bork@ 
embl-heidelberg .de . 



production of the hormone indole-3-acetic acid (Kobayashi 
et al., 1993). 

2. Cyanide hydratase of pathogenic fungi detoxifies HCN 
that is released by their hosts, cyanogenic plants, after in- 
jury. Its sequence is very similar to those of nitrilases 
(Wang & Van Etten, 1992) and is actually more closely re- 
lated to prokaryotic nitrilases than the latter are to eukary- 
otic ones (Fig. 1). 

3. Aliphatic amidases enable prokaryotes to use acetamides 
as both carbon and nitrogen source (see Soubrier et al., 
1992, and references therein). 

4. jS- Alanine synthase (7V-carbamoyl-/3-alanine amido hydro- 
lase) catalyzes the last step of pyrimidine catabolism; the 
resulting j8-alanine can be further converted into pyruvate 
but can also be incorporated into various pathways. In ad- 
dition to the characterized enzyme from rat (Kvalnes-Krick 
& Traut, 1993), our database search revealed a closely re- 
lated (5Q°/o sequence identity over 98 residues) expressed 
sequence tag (EST) from Caenorhabditis elegans (EMBL 
accession no. Z 14933). 

5. AdgA (for ammonia-dependent growth) from Rhodobac- 
ter species appears to be essential for using various amino 
acids as nitrogen sources; mutations in this gene lead to 
the requirement for ammonia in the medium for growth 
(Willison, 1993). AdgA is the only protein of this sequence 
family with an additional domain (Willison, 1993), a pu- 
tative ATP pyrophosphatase (data not shown). 

6. An open reading frame (ORF5) in the agr-operon of 
Staphylococcus species (Vandenesch et al., 1993; J. Korn- 
blum, S.J. Projan, B.N. Kreiswirth, S.L. Mogazeh, W. 
Eisner, H. Ross, R.P. Novick, unpubl.: EMBL accession 
no. X52543; Agr5 in Fig. 1) comprise yet another distinct 
group and probably possess an as yet uncharacterized en- 
zymatic activity. 

The sequence similarity within the 6 groups is very high, 
whereas it ranges only between 12 and 24<7o amino acid identity 
between the groups (Fig. 1). Database searches performed using 
the Blastp program (Altschul et al., 1990) indicated a very low 
probability (P-value) of matching by chance for nitrilases ver- 
sus cyanide hydratases (below KT 20 ) and for ORF5 in the agr- 
operon versus aliphatic amidases (below 10" 8 ). The P-value of 
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Table 1. Summary of the carbon-nitrogen hydrolase family 



Enzyme 



EC 
number 



Identical 
subunits 



Chemical reaction 



References 



0-Alanine synthase 
Cyanide hydratase 
Aliphatic amidase 
Nitrilases 
AdgA 



EC 3.5.1.6 
EC 4.2.1.66 
EC 3.5.1.4 
EC 3.5.5.1 



Hcxamer 
Mul timer 
Tetramer 
Hcxamer, dimer 



/V-carbamoyl-tf-alanine + H 2 0 » ^-alanine + CO2 + NH 3 

Hydrogen cyanide + H2O = formamide (CHO-NH2) 

Monocarboxylic acid amide + H 2 0 = monocarboxylate + NH3 
Nitrile + H 2 0 = carboxylate + NH 3 

X-NH 2 + ATP + H z O *= Y + AMP + PP + NH 3 



Kvalnes-Krick & Traut, 1993 
Clunesset al., 1993 
Soubrier et al.. 1992 
Ko bay as hi ct al., 1992 
Willison, 1993 



7.7 x 10~ 4 was observed for the alignment between 0-alanine 
synthase and one of the fungal cyanide hydratases. In contrast, 
sequences from other subsets of the family (Fig. 1) showed only 
very low, not statistically significant, similarity to each other in 
these initial searches. Nevertheless, the fact that the segments 
detected by Blastp included the same regions that are most highly 
conserved in each of the subsets prompted further investigation 
of these marginal similarities. 

Multiple alignment with the ClustalW program (Higgins et al., 
1992; D. Higgins, J. Thompson, &T. Gibson, unpubl.) revealed 
an overall similarity of all proteins (Fig. 1). The alignment was 
confirmed with the MACAW program (Schuler et al., 1991), 
which calculated for 3 blocks (overlined in Fig. 1) a probabil- 
ity of matching by chance below 10~ 7 (Fig. 1). For the 2 most 
conserved motifs (double overlined in Fig. 1), property pattern 
database searches were carried out (PROPAT program; Rohde 
& Bork, 1993). We found that the patterns were unique for this 
family, i.e., no additional sequences with comparable level of 
similarity to the conserved patterns were detected. 

The conserved blocks span almost the entire length of the pro- 
teins in the new family (with the only exception of AdgA that 
contains an additional domain), which is highly suggestive of 
a common topology. This is supported by secondary structure 
predictions (Rost & Sander, 1993) that give similar results for 
different groups (Fig. 1). 

Although one has to be very cautious in transferring func- 
tional information between distantly related sequences (Bork 
et al., 1994), a few generalizations can be made. One of the con- 
served motifs shared by all the proteins of the family contains 
an invariant cysteine (Fig. 1). This residue has been shown to 
be involved in the active site of very different nitrilases 
(Kobayashi et al„ 1992, 1993). It appears likely that this cysteine 
plays the same role in all members of this family of enzymes. 
The most conserved motif (the probability of chance occurrence 
is 7.9 x 10" 18 as calculated using the MACAW algorithm) is lo- 
cated toward the N-terminus of the proteins. It contains an in- 
variant glutamic acid that follows a very hydrophobic, probably 
interior 0-strand (Fig. 1). The glutamic acid residue might thus 
be another direct participant in the cleavage reaction. 

An interesting feature of the family described here is the 
strong hydrophobicity within the conserved regions. The hydro- 
phobicity might be due to subunit contacts and indeed all of the 
characterized proteins appear to form multimers (Table 1). An 
alternative explanation is a tightly packed 0-sheet with the hy- 
drophobic j3-strands located in the core and shielded from the 
protein surface. 

The family described here consists of functionally diverse 
proteins such as nitrilases, amidases, cyanide hydratase, and 



^-alanine synthase. On the other hand, sequences are avail- 
able for several other nitrilases and amidases that appear to be 
completely unrelated to the members of the family delineated 
here. Thus, enzymes involved in the metabolism of various 
organic nitrogen compounds seem to be yet another example 
of convergent evolution of proteins toward similar enzymatic 
activity. 
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